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Introduction 


a  Failure  prognosis  and  prediction  of  future  state  of  operation  is  important  to  ensure  continued 
operation  and  to  exercise  condition  base  maintenance. 

a  Most  of  the  work  in  machines  is  focused  on  the  diagnosis  instead  of  prognosis.  But  time  to 
failure,  or  remaining  useful  life  is  important. 

a  Generally  prognosis  needs  large  historic  data  sets  to  extract  fault  progression  trends,  which 
are  not  available  in  most  of  the  cases. 

The  problem  under  study 

a  A  method  for  prognosis  of  electrical  failures  in  a  PM  synchronous  motor  is  presented. 

a  Transient  increased  contact  resistance  faults  are  investigated.  Similar  results  for  turn-turn 
and  turn  to  frame  faults. 

a  Objective:  Determine  the  probability  of  failure  at  the  next  step. 
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Introduction 


Methodology 

a  Fault  prognosis  is  the  next  step  of  diagnosis  and  diagnosis  information  forms  the  basis  of 
prognosis  techniques 

a  the  q—  axis  current  is  used  to  extract  information  about  faults 
®  Features  of  fault  characteristics  are  extracted  from  Time  Frequency  distributions 
«  Diagnosis  -  Linear  Discriminant  Classifier  (LDC) 

►  Training  of  classifier  for  discrete  fault  states 

►  Classification  of  test  samples 

a  Prognosis  -  Hidden  Markov  Model  (HMM) 

►  Training/defining  of  HMM  model  parameters 

*  Calculating  State  dependent  observation  probability  (B) 

*  Calculation  of  State  Transition  probabilities  (A) 

*  Defining  the  initial  state  probabilities  (7r) 

►  Prediction  of  failure  state  probability 
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Type  of  fault  and  experimental  setup 


a  Samples  were  time  frequency  features  of  the  current  iq, 

a  Artificial  faults  were  imposed.  These  are  transient  faults  representing  increased  contact 
resistance,  of  fixed  value  and  of  fixed  duration. 

o  A  fault  is  identified  by  recognizing  both  inception  and  clearing,  but  only  the  inception 
transient  is  used  to  determine  fault  severity. 


Figure:  Sampled  lq  Current 
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Machine  and  fault  characteristics 


9 


Specification  of  the 
No  of  poles 
Construction  type 
Rated  voltage 
Usage 

Rated  power 
No  load  speed 


test  machine: 

6 

Surface  mounted  PMAC  machine 
12V 

Automotive  application 
1HP 

3000 rpm 


a  A  custom  FPGA-based  I/O  board  was  used  as  the  drive  controller.  Its  response  was  much 
slower  than  the  fault  dynamics 


a  The  sampling  frequency  was  16.67kHz. 
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Application  of  Proposed  Method 


a  The  fault  studied  was  an  intermittently  increased  contact  resistance,  a  transient  electrical 
fault  in  the  winding  of  the  PMSM  machine. 

a  The  fault  was  created  in  the  lab  by  inserting  a  resistor  in  series  with  a  terminal  in  the 
winding  to  simulate  a  momentary  breaking  of  the  contact. 

The  severity  of  the  fault  was  primarily  defined  by  the  value  of  the  inserted  resistance. 

a  Resistances  of  2.14pu,  2.80 pu,  4.03 pu,  6.33 pu,  and  15.84pu  were  inserted  to  mimic  a 
progressively  worsening  fault. 

«  The  transient  faults  have  two  stages,  inception  and  removal.  Both  can  be  classified  as 
separate  events  by  the  analysis  of  the  time  frequency  features, 
a  In  this  work,  the  fault  features  were  extracted  from  the  inception  event. 
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Feature  Extraction  -  Three  methods 

Un-decimated  Wavelet  Transform  (UDWT) 


O  UDWT  has  greater  flexibility  compared  to  STFT 
9  Different  base  functions  can  be  used 
O  Good  time  resolution  and  high  frequency  resolution 

$  Tiling  is  variable 

Wigner  Ville  Distribution 

Defined  as 


where  s  is  input  signal,  t  is  the  time,  oj  is  the  frequency 
9  High  time- frequency  resolution 
9  No  tradeoff  problem  between  time  and  frequency 

$  The  major  shortcoming  is  of  multi-component  signals  in  terms  of  the  cross-terms 

Choi  Williams  Transform 


Defined  as 


where  s  is  input  signal,  t  is  the  time,  uj  is  the  frequency,  cr  is  the  smoothing  parameter 
9  Filtered/smoothed  version  of  the  Wigner  distribution 
$  Amount  of  smoothing  is  controlled  by  cr 
$  Smoothing  comes  with  a  tradeoff  of  reduced  resolution 
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Classification 


a  Linear  Discriminant  Classifier  (LDC)  is  used 
d  The  discriminant  function  is  defined  as: 

Dk(x)  =  xlalk  +  x2&2k  + . +  xN&Nk  +  Xl+Nal+Nk 

where  a;k  are  the  trained  coefficients  for  kth  class,  x  is  the  features  vector  used  for  training, 
a  A  sample  vector  of  coefficients  belongs  to  a  particular  class  if  the  discriminant  function  is 
greater  for  that  class  than  for  any  other  class  i.e.  x  belongs  to  class  Cj  if 

Dj(x)  >  Dk(x)  ,  for  k^j 
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Prognosticator  -  Hidden  Markov  Model 


a  A  statistical  modeling  method  which  assumes  states  to  be  Markovian, 
a  States  here  correspond  to  discrete  levels  of  fault  severity. 

«  Finds  the  hidden  variables  from  the  observable  parameters. 

►  Observable  Parameters  -  Features  extracted  form  sampled  signals. 

►  Hidden  Parameters  -  Machine  States 

»  The  model  parameters 

►  State  transition  probability  matrix  (a,y  =  p(xt+i  =  j\xt  =  /')) 

►  State-dependent  observation  density  (by(yt)  =  p(yt|xt  =  j)) 

►  Initial  state  probability  (m  =  p(x i  =  /)) 
a  Problem  to  be  solved 

►  Given  the  observation  sequence  y  =  (yi,  ■  •  -,yk)  and  set  of  model  parameters,  choose  corresponding 
state  sequence  x  =  (xi,  •  •  • ,  Xk )  which  is  optimal  to  have  generated  the  observation  sequence. 
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Prediction  Algorithm 


a  The  normalized  forward  probability,  6t(i),  at  time  t  for  each  state  S/ ,  and  the  state  transition 
probabilities,  a,y  are  used  to  predict  the  probabilities  of  states  at  time  t  +  1.  The  transition 
probability  to  state  Sj  at  the  time  instance  t+1  is  given  by 

pht+i=  SjlA]  =X]p[qt=  silA]aij  =  5Z5tWaij 

i=l  i=l 

where  A  is  the  set  of  model  parameters. 

a  The  most  probable  state  at  time  t  +  1  is  one  which  has  the  highest  probability 
a  The  predicted  state  probabilities  are  updated  using  state  dependent  observation  bt(J)  at  each 
time  step.  The  algorithm  works  as  follows: 
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Prediction  Algorithm 


a  Initialize 


a  Recursion 


o 

*9 

l? 

II 

1  <  /  <  N 

qi{i)  =  0  i< 

i  <  N 

qt(J)  =  argmax  ^  [5t_i(/)a,y] 

2  <  t  <  T, 

1  <j  <  N 

l<i<N 

st(J)  =  [St-i(i)aj]bj(Ot) 

2<t<T , 

l<j<N 

i<i<N 
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State  Dependent  Observation  Distribution  B 


Experimental  method  to  obtain  the  distributions: 

a  The  distributions  of  the  projections  of  the  observations  are  assumed  to  be  Gaussian 


a  Training  phase  gives  a  set  of  LDC  coefficients  corresponding  to  each  class  (called  LDC  plane) 
a  Training  samples  are  projected  on  all  LDC  planes 

»  Mean  and  variances  of  projections  are  used  to  define  the  state  dependent  observation 
distribution 
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State  Transition  (A)  and  Initial  State  (7r)  Probabilities 


Both  should  result  from  extensive  testing  and  aging/fatigue  models. 

a  The  state  transition  probabilities  are  computed  using  a  heuristic  method 
®  The  probabilities  are  computed  from  matching  pursuit  decomposition  of  the  sampled  data 


a  The  initial  state  probabilities  should  be 

►  From  the  manufacturer 

►  Repair  facilities 

►  Large  scale  sample  analysis 

o  Assumed  values  are  used  for  the  demonstration 

a  Prognosis  algorithm  was  tested  with  an  assumption  that  the  machine  already  has  traces  of 
fault 

a  The  initial  probability  of  class  2  is  set  high  as  compared  to  others. 
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State  Dependent  Observation  Density  Statistics  -  UDWT 


9  Means  of  the  projection  on  each  plane 


Pi 

P2 

p3 

P4 

p5 

p6 

1 

0.6263 

0.8329 

1.6662 

1.9190 

3.0788 

4.43410 

2 

-0.0668 

14.2365 

17.0266 

20.4563 

28.99.79 

43.0349 

3 

-0.0177 

14.0670 

17.3373 

20.5170 

29.3376 

43.7851 

4 

-0.0268 

13.2008 

16.0605 

22.1399 

32.5496 

48.0109 

5 

-0.0034 

12.5658 

15.4205 

21.8770 

32.9591 

48.6149 

6 

-0.0110 

12.5841 

15.5384 

21.8047 

32.8978 

48.6836 

a  Variances  of  the  projections  on  LDC  planes 


Pi 

Pi 

p3 

Pa 

p5 

p6 

1 

0.2188 

0.3748 

0.6602 

0.2251 

0.1785 

0.2058 

2 

0.0031 

0.0241 

0.0176 

0.1725 

0.0470 

0.0425 

3 

0.0072 

0.0812 

0.0228 

0.4906 

0.0600 

0.0500 

4 

0.0066 

0.0299 

0.0251 

0.0873 

0.0269 

0.0583 

5 

0.0094 

0.0168 

0.0328 

0.2754 

0.0238 

0.0626 

6 

0.0086 

0.0302 

0.0278 

0.4548 

0.0201 

0.0584 

►  C  is  the  class,  P  is  the  LDC  plane 
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A  State  Dependent  Observation  Density  Statistics-  WVD 


a  Means  of  the  projection  on  each  plane 


Pi 

Pi 

Pi 

p* 

Ps 

Pf, 

1 

6.8544 

5.8337 

5.0972 

3.6628 

2.3158 

1.1945 

2 

6.8033 

5.8776 

5.1430 

3.7109 

2.3900 

1.3013 

3 

6.7765 

5.8629 

5.1559 

3.7687 

2.4843 

1.4219 

4 

6.5098 

5.6555 

5.0384 

3.8567 

2.7262 

1.7653 

5 

5.5336 

4.8971 

4.4652 

3.6653 

2.8685 

2.1619 

6 

3.4635 

3.2352 

3.1011 

2.8799 

2.6232 

2.3641 

o  Variances  of  the  projections  on  LDC  planes 


Pi 

Pi 

Pi 

Pt 

Ps 

Ps 

1 

0.0286 

0.0458 

0.2820 

4.7961 

0.2259 

0.0182 

2 

0.0284 

0.0521 

0.3000 

4.6780 

0.2371 

0.0162 

3 

0.0268 

0.0515 

0.2849 

4.5926 

0.2294 

0.0152 

4 

0.0233 

0.1134 

0.2013 

4.7297 

0.1701 

0.0120 

5 

0.0137 

0.2317 

0.0988 

3.5576 

0.0948 

0.0069 

6 

0.0033 

0.2408 

0.0153 

1.2946 

0.0203 

0.0039 

►  C  is  the  class,  P  is  the  LDC  plane 
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State  Dependent  Observation  Density  Statistics-  CWD 


9  Means  of  the  projection  on  each  plane 


Pi 

Pi 

Pi 

p* 

Ps 

Pf, 

1 

6.8545 

5.9820 

5.2338 

3.8745 

2.6148 

1.5756 

2 

6.8426 

5.9924 

5.2388 

3.8752 

2.6176 

1.5820 

3 

6.8364 

5.9823 

5.2477 

3.9147 

2.6789 

1.6580 

4 

6.7027 

5.8607 

5.1847 

3.9623 

2.8096 

1.8467 

5 

6.2210 

5.4443 

4.8794 

3.8640 

2.8811 

2.0450 

6 

5.0197 

4.4060 

4.0439 

3.4009 

2.7384 

2.1516 

a  Variances  of  the  projections  on  LDC  planes 


Pi 

Pi 

Pi 

Pa 

Pi 

p6 

1 

0.0218 

0.0554 

0.2132 

3.5857 

0.1953 

0.0143 

2 

0.0215 

0.0584 

0.2179 

3.5532 

0.1981 

0.0140 

3 

0.0212 

0.0522 

0.2066 

3.5301 

0.1898 

0.0136 

4 

0.0197 

0.0427 

0.1680 

3.4105 

0.1592 

0.0120 

5 

0.0160 

0.0502 

0.1116 

2.9207 

0.1113 

0.0089 

6 

0.0096 

0.0891 

0.0426 

1.9052 

0.0464 

0.0046 

►  C  is  the  class,  P  is  the  LDC  plane 
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Probability 


Plots  of  the  State  Dependent  Observation  Density  Satistics 


Figure:  UDWT 


Figure:  Wigner 


Figure:  Choi-Williams 
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HMM  Parameter  Values  -  State  Transition  Probability  Matrix 


a  Sample  mean  of  each  class  was  calculated 
9  MP  decomposition  was  performed 

►  A  greedy  adaptive  algorithm 

►  Chooses  the  atom  of  the  dictionary  that  best  represents  the  signal 

►  Decomposition  was  performed  by  Gabor  dictionary 

►  Uses  Gaussian  window  for  atom  generation  g(t )  =  e~  2f 

►  3905  normalized  atoms  were  generated  by  time  shifting,  scaling,  and  modulation 

gy(t)  =  (k-,/s/?)g(t  -  T/s)cos(£t  +  <t> ) 

where  s  is  scaling  constant,  r  is  time  shift,  k  is  normalizing  coefficient 

»  From  the  decomposed  samples  the  state  transition  probabilities  are  calculated 
a  Only  forward  path  probabilities  were  allowed  as  the  fault  analyzed  is  non  reversible 


Class 

1 

2 

3 

4 

5 

6 

1 

0.5063 

0.2435 

0.1481 

0.0800 

0.0200 

0.0021 

2 

0 

0.4935 

0.2127 

0.1651 

0.0967 

0.0320 

3 

0 

0 

0.4542 

0.2709 

0.1535 

0.1214 

4 

0 

0 

0 

0.4529 

0.2900 

0.2571 

5 

0 

0 

0 

0 

0.5800 

0.4200 

6 

0 

0 

0 

0 

0 

1.0000 
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HMM  Parameter  Values  - 
Assumed  Initial  State  Probabilities  (7r) 


Historic  data,  or  manufacturer  input  was  not  available 
Assumed  values  were  used  to  demonstrate  the  developed  method 


Class 

1 

2 

3 

4 

5 

6 

Probability 

0.07 

0.60 

0.15 

0.1 

0.08 

0 
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Probability  of  the  Failure  State  (Cg) 

An  artificial  sequence  of  observations  was  constructed  to  test  the  method.  These  observations 
were  actual  sampled  signals  arranged  in  order  of  severity  to  mimic  a  natural  fault  progression. 


Failure  Probability  -  UDWT 


Failure  Probability  -  WVD 


Observation  Number 


Failure  Probability  -  CWD 


Observation  Number 
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Results 


a  State  Dependent  Observation  probabilities  are  higher  if  the  projection  are  made  on  the 
corresponding  plane 

The  values  of  bj  were  computed  using  UDWT  have  less  discriminative  value,  except  for  the 
healthy  class  (Ci),  in  comparison  with  the  the  bj  obtained  using  the  Wigner  or  Choi-Williams 
distributions 

»  Although  for  Wigner  and  Choi-Williams  distributions  the  probabilities  are  close  for  the  early 
fault  severities,  these  are  adequately  discriminative  for  the  high  fault  severities. 

®  The  values  of  bj  computed  using  features  extracted  from  Choi-Williams  distribution  are 
slightly  more  discriminative  then  the  Wigner  distribution,  due  to  the  smoothness  provided  by 
Choi-Williams  distributions. 

a  The  probability  of  failure,  class  6,  increases  as  the  current  state  of  fault  severity  increases, 
which  is  in  accordance  with  the  expected  results. 
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Conclusions 


a  A  fault  prognosis  method  was  developed  and  demonstrated  based  on  the  Hidden  Markov 
Model 

Parameters  of  the  HMM  were  calculated  through  a  mixture  of  experiments  and  heuristic 
methods 

d  The  HMM  is  used  to  estimate  the  most  probable  next  state  at  every  time  step,  as  well  as  the 
probability  of  failure 

a  The  prognosis  algorithm  produces  similar  failure  probabilities  for  all  three  distributions  used 
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